TWITTIRÒ: a Social Media Corpus with a Multi-layered Annotation for Irony

نویسندگان

  • Alessandra Teresa Cignarella
  • Cristina Bosco
  • Viviana Patti
چکیده

English. In this paper we describe our work concerning the application of a multi-layered scheme for the fine-grained annotation of irony (Karoui et al., 2017) on a new Italian social media corpus. In applying the annotation on this corpus containing tweets, i.e. TWITTIRÒ, we outlined both strengths and weaknesses of the scheme when applied on Italian, thus giving further clarity on the future directions that can be followed in the multilingual and cross-language perspective. Italiano. In questo articolo descriviamo la creazione di un corpus di testi estratti da social media in italiano e l’applicazione ad esso di uno schema multilivello per l’annotazione a grana fine dell’ironia sviluppato in (Karoui et al., 2017). Nell’applicare l’annotazione a questo corpus composto da messaggi di Twitter, i.e. TWITTIRÒ, abbiamo discusso i punti di forza ed i limiti dello schema stesso, in modo da evidenziare le direzioni da seguire in futuro anche in prospettiva multilingue e cross linguistica.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploring the Impact of Pragmatic Phenomena on Irony Detection in Tweets: A Multilingual Corpus Study

This paper provides a linguistic and pragmatic analysis of the phenomenon of irony in order to represent how Twitter’s users exploit irony devices within their communication strategies for generating textual contents. We aim to measure the impact of a wide-range of pragmatic phenomena in the interpretation of irony, and to investigate how these phenomena interact with contexts local to the twee...

متن کامل

4 th International Workshop on Corpora for Research on EMOTION SENTIMENT & SOCIAL SIGNALS ES 3 2012

In this paper we describe our current work on Senti–TUT, a novel Italian corpus for sentiment analysis. This resource includes annotations concerning both sentiment and morpho-syntax, in order to make available several possibilities of further exploitation related to sentiment analysis. For what concerns the annotation at sentiment level, we focus on irony and we selected therefore texts on pol...

متن کامل

A Multi-View Sentiment Corpus

Sentiment Analysis is a broad task that involves the analysis of various aspect of the natural language text. However, most of the approaches in the state of the art usually investigate independently each aspect, i.e. Subjectivity Classification, Sentiment Polarity Classification, Emotion Recognition, Irony Detection. In this paper we present a Multi-View Sentiment Corpus (MVSC), which comprise...

متن کامل

Exploring the Realization of Irony in Twitter Data

Cynthia Van Hee, Els Lefever and Véronique Hoste LT3, Language and Translation Technology Team Ghent University Groot-Brittanniëlaan 45, 9000 Ghent, Belgium cynthia.vanhee, els.lefever, [email protected] Abstract Handling figurative language like irony is currently a challenging task in natural language processing. Since irony is commonly used in user-generated content, its presence can ...

متن کامل

Linguistic-based Patterns for Figurative Language Processing: The Case of Humor Recognition and Irony Detection

Figurative language represents one of the most difficult tasks regarding natural language processing. Unlike literal language, figurative language takes advantage of linguistic devices such as irony, humor, sarcasm, metaphor, analogy, and so on, in order to communicate indirect meanings which, usually, are not interpretable by simply decoding syntactic or semantic information. Rather, figurativ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017